NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

PDB-IHM: A System for Deposition, Curation, Validation, and Dissemination of Integrative Structures

https://doi.org/10.1016/j.jmb.2025.168963

Vallat, Brinda; Webb, Benjamin M; Zalevsky, Arthur; Tangmunarunkit, Hongsuda; Sekharan, Monica R; Voinea, Serban; Shafaeibejestan, Aref; Sagendorf, Jared; Hoch, Jeffrey C; Kurisu, Genji; et al (August 2025, Journal of Molecular Biology)

Structures of many large biomolecular assemblies are now being determined using integrative approaches. In these approaches, information derived from multiple experimental and computational methods is combined to compute three-dimensional structures of multi-protein complexes and other macromolecular machines. A standalone prototype data resource for integrative structures called PDB-Dev was built, based on recommendations of the Integrative and Hybrid Methods (IHM) Task Force of the Worldwide Protein Data Bank (wwPDB). This effort included developing data standards and software tools for collecting, curating, validating, visualizing, archiving, and disseminating integrative structures that span diverse spatiotemporal scales and conformational states. Mechanisms have been created to validate integrative structures based on the experimental data underpinning them. Building upon this foundational framework, PDB-Dev has been further expanded to handle large dynamic macromolecular systems and integrative structures that combine, for example, experimental restraints with atomic coordinates computed by machine learning algorithms. Data standards and supporting tools have also been extended to capture information about biomolecular dynamics, such as conformational transitions and related kinetic data derived from biophysical methods. Recently, PDB-Dev was unified with the PDB archive and rebranded as PDB-IHM (pdb-ihm.org), further promoting FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data stewardship for integrative structural biology.
more » « less
Free, publicly-accessible full text available August 1, 2026
Describing and Sharing Molecular Visualizations Using the MolViewSpec Toolkit

https://doi.org/10.1002/cpz1.1099

Bittrich, Sebastian; Midlik, Adam; Varadi, Mihaly; Velankar, Sameer; Burley, Stephen K; Young, Jasmine Y; Sehnal, David; Vallat, Brinda (July 2024, Current Protocols)

With the ever‐expanding toolkit of molecular viewers, the ability to visualize macromolecular structures has never been more accessible. Yet, the idiosyncratic technical intricacies across tools and the integration complexities associated with handling structure annotation data present significant barriers to seamless interoperability and steep learning curves for many users. The necessity for reproducible data visualizations is at the forefront of the current challenges. Recently, we introduced MolViewSpec (homepage:https://molstar.org/mol‐view‐spec/, GitHub project:https://github.com/molstar/mol‐view‐spec), a specification approach that defines molecular visualizations, decoupling them from the varying implementation details of different molecular viewers. Through the protocols presented herein, we demonstrate how to use MolViewSpec and its 3D view–building Python library for creating sophisticated, customized 3D views covering all standard molecular visualizations. MolViewSpec supports representations like cartoon and ball‐and‐stick with coloring, labeling, and applying complex transformations such as superposition to any macromolecular structure file in mmCIF, BinaryCIF, and PDB formats. These examples showcase progress towards reusability and interoperability of molecular 3D visualization in an era when handling molecular structures at scale is a timely and pressing matter in structural bioinformatics as well as research and education across the life sciences.
more » « less
Full Text Available
Announcing the launch of Protein Data Bank China as an Associate Member of the Worldwide Protein Data Bank Partnership

https://doi.org/10.1107/S2059798323006381

Xu, Wenqing; Velankar, Sameer; Patwardhan, Ardan; Hoch, Jeffrey C.; Burley, Stephen K.; Kurisu, Genji (September 2023, Acta Crystallographica Section D Structural Biology)

The Protein Data Bank (PDB) is the single global archive of atomic-level, three-dimensional structures of biological macromolecules experimentally determined by macromolecular crystallography, nuclear magnetic resonance spectroscopy or three-dimensional cryo-electron microscopy. The PDB is growing continuously, with a recent rapid increase in new structure depositions from Asia. In 2022, the Worldwide Protein Data Bank (wwPDB; https://www.wwpdb.org/) partners welcomed Protein Data Bank China (PDBc; https://www.pdbc.org.cn) to the organization as an Associate Member. PDBc is based in the National Facility for Protein Science in Shanghai which is associated with the Shanghai Advanced Research Institute of Chinese Academy of Sciences, the Shanghai Institute for Advanced Immunochemical Studies and the iHuman Institute of ShanghaiTech University. This letter describes the history of the wwPDB, recently established mechanisms for adding new wwPDB data centers and the processes developed to bring PDBc into the partnership.
more » « less
Full Text Available
Characterizing and explaining the impact of disease-associated mutations in proteins without known structures or structural homologs

https://doi.org/10.1093/bib/bbac187

Sen, Neeladri; Anishchenko, Ivan; Bordin, Nicola; Sillitoe, Ian; Velankar, Sameer; Baker, David; Orengo, Christine (July 2022, Briefings in Bioinformatics)

Abstract Mutations in human proteins lead to diseases. The structure of these proteins can help understand the mechanism of such diseases and develop therapeutics against them. With improved deep learning techniques, such as RoseTTAFold and AlphaFold, we can predict the structure of proteins even in the absence of structural homologs. We modeled and extracted the domains from 553 disease-associated human proteins without known protein structures or close homologs in the Protein Databank. We noticed that the model quality was higher and the Root mean square deviation (RMSD) lower between AlphaFold and RoseTTAFold models for domains that could be assigned to CATH families as compared to those which could only be assigned to Pfam families of unknown structure or could not be assigned to either. We predicted ligand-binding sites, protein–protein interfaces and conserved residues in these predicted structures. We then explored whether the disease-associated missense mutations were in the proximity of these predicted functional sites, whether they destabilized the protein structure based on ddG calculations or whether they were predicted to be pathogenic. We could explain 80% of these disease-associated mutations based on proximity to functional sites, structural destabilization or pathogenicity. When compared to polymorphisms, a larger percentage of disease-associated missense mutations were buried, closer to predicted functional sites, predicted as destabilizing and pathogenic. Usage of models from the two state-of-the-art techniques provide better confidence in our predictions, and we explain 93 additional mutations based on RoseTTAFold models which could not be explained based solely on AlphaFold models.
more » « less
Full Text Available
IHMCIF: An Extension of the PDBx/mmCIF Data Standard for Integrative Structure Determination Methods

https://doi.org/10.1016/j.jmb.2024.168546

Vallat, Brinda; Webb, Benjamin M; Westbrook, John D; Goddard, Thomas D; Hanke, Christian A; Graziadei, Andrea; Peisach, Ezra; Zalevsky, Arthur; Sagendorf, Jared; Tangmunarunkit, Hongsuda; et al (March 2024, Journal of Molecular Biology)

Full Text Available
ModelCIF: An Extension of PDBx/mmCIF Data Representation for Computed Structure Models

https://doi.org/10.1016/j.jmb.2023.168021

Vallat, Brinda; Tauriello, Gerardo; Bienert, Stefan; Haas, Juergen; Webb, Benjamin M.; Žídek, Augustin; Zheng, Wei; Peisach, Ezra; Piehl, Dennis W.; Anischanka, Ivan; et al (July 2023, Journal of Molecular Biology)

Full Text Available
Modernized uniform representation of carbohydrate molecules in the Protein Data Bank

https://doi.org/10.1093/glycob/cwab039

Shao, Chenghua; Feng, Zukang; Westbrook, John D; Peisach, Ezra; Berrisford, John; Ikegawa, Yasuyo; Kurisu, Genji; Velankar, Sameer; Burley, Stephen K; Young, Jasmine Y (May 2021, Glycobiology)

Abstract Since 1971, the Protein Data Bank (PDB) has served as the single global archive for experimentally determined 3D structures of biological macromolecules made freely available to the global community according to the FAIR principles of Findability–Accessibility–Interoperability–Reusability. During the first 50 years of continuous PDB operations, standards for data representation have evolved to better represent rich and complex biological phenomena. Carbohydrate molecules present in more than 14,000 PDB structures have recently been reviewed and remediated to conform to a new standardized format. This machine-readable data representation for carbohydrates occurring in the PDB structures and the corresponding reference data improves the findability, accessibility, interoperability and reusability of structural information pertaining to these molecules. The PDB Exchange MacroMolecular Crystallographic Information File data dictionary now supports (i) standardized atom nomenclature that conforms to International Union of Pure and Applied Chemistry-International Union of Biochemistry and Molecular Biology (IUPAC-IUBMB) recommendations for carbohydrates, (ii) uniform representation of branched entities for oligosaccharides, (iii) commonly used linear descriptors of carbohydrates developed by the glycoscience community and (iv) annotation of glycosylation sites in proteins. For the first time, carbohydrates in PDB structures are consistently represented as collections of standardized monosaccharides, which precisely describe oligosaccharide structures and enable improved carbohydrate visualization, structure validation, robust quantitative and qualitative analyses, search for dendritic structures and classification. The uniform representation of carbohydrate molecules in the PDB described herein will facilitate broader usage of the resource by the glycoscience community and researchers studying glycoproteins.
more » « less
Full Text Available
PDBx/mmCIF Ecosystem: Foundational Semantic Tools for Structural Biology

https://doi.org/10.1016/j.jmb.2022.167599

Westbrook, John D.; Young, Jasmine Y.; Shao, Chenghua; Feng, Zukang; Guranovic, Vladimir; Lawson, Catherine L.; Vallat, Brinda; Adams, Paul D.; Berrisford, John M; Bricogne, Gerard; et al (April 2022, Journal of Molecular Biology)

Full Text Available
The need to implement FAIR principles in biomolecular simulations

https://doi.org/10.1038/s41592-025-02635-0

Amaro, Rommie E; Åqvist, Johan; Bahar, Ivet; Battistini, Federica; Bellaiche, Adam; Beltran, Daniel; Biggin, Philip C; Bonomi, Massimiliano; Bowman, Gregory R; Bryce, Richard A; et al (April 2025, Nature Methods)

In the Big Data era, a change of paradigm in the use of molecular dynamics is required. Trajectories should be stored under FAIR (findable, accessible, interoperable and reusable) requirements to favor its reuse by the community under an open science paradigm.
more » « less
Free, publicly-accessible full text available April 1, 2026
Announcing mandatory submission of PDBx/mmCIF format files for crystallographic depositions to the Protein Data Bank (PDB)

https://doi.org/10.1107/S2059798319004522

Adams, Paul D.; Afonine, Pavel V.; Baskaran, Kumaran; Berman, Helen M.; Berrisford, John; Bricogne, Gerard; Brown, David G.; Burley, Stephen K.; Chen, Minyu; Feng, Zukang; et al (April 2019, Acta Crystallographica Section D Structural Biology)

Full Text Available

« Prev Next »

Search for: All records